WinPitch Corpus, a Text to Speech Alignment Tool for Multimodal Corpora

نویسنده

  • Philippe Martin
چکیده

WinPitch Corpus is an innovative software program for computer-aided alignment of large corpora. It provides a method for easy and precise selection of alignment units, ranging from syllable to whole sentences in a hierarchical storing system of aligned data. The method is based on the ability to link visually and select with a mouse click a text segment with the perception of the corresponding speech sound played back at slower speech. Clicking on a text segment generates bidirectional speech-text pointers defining the alignment. This method has the advantage on emerging automatic processes to be effective even for poor quality speech recordings, or in case of speakers’ voice overlap. A recent version of the software handles multimedia files and is capable to display the corresponding video streams at slower speed.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

WinPitch: A Multimodal Tool for Speech Analysis of Endangered Languages

WinPitch is a speech analysis program running on PC and Mac personal computers for acoustical analysis of speech corpora. It includes a large number of specialized functions to transcribe, align and analyze large sound and video recordings. It supports multiple hierarchical layers for segmentation (up to 96 layers), speaker lists, and overlapping speech. Various character encodings, including U...

متن کامل

Winpitch Corpus, a Software Tool for Alignment and Analysis of Large Corpora

Description of endangered languages normally starts with the collection of speech data, which are then segmented into various phonological, prosodic, morphological and syntactic units. In this process, the (phonetic ) transcription is the most critical part, and user friendly tools are essential to tackle any sizeable work in a reasonable amount of time. The software program WinPitch Corpus add...

متن کامل

New functions for a multipurpose multimodal tool for phonetic and linguistic analysis of very large speech corpora

The increased interest for linguistic analysis of spontaneous (i.e. non-prepared) speech from various points of view (semantic, syntactic, morphologic, phonologic and intonative) lead to the development of ever more sophisticated dedicated tools. Although the software Praat emerged as the de facto standard for the analysis of spoken data, its use for intonation studies is often felt as not opti...

متن کامل

Evaluating Factors Impacting the Accuracy of Forced Alignments in a Multimodal Corpus

People, when processing human-to-human communication, utilize everything they can in order to understand that communication, including speech and information such as the time and location of an interlocutor’s gesture and gaze. Speech and gesture are known to exhibit a synchronous relationship in human communication; however, the precise nature of that relationship requires further investigation...

متن کامل

BECAM tool - a semi-automatic tool for bootstrapping emotion corpus annotation and management

Corpus annotation is an important aspect in speech applications where stochastic models need to be trained and evaluated. Multimodal corpora are also annotated. Moreover, corpus annotation is an essential phase in the construction of emotion recognizer engines. Large corpora, as they are essential to construct representative knowledge bases, have been a problem for corpus annotators. Time consu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004